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Abstract — Because of limited output swing, the basic structure 
of a gain-boosted cascode amplifier (GBCA) cannot be used to de- 
sign a low-voltage operational amplifier. In this paper, we investi- 
gate the design of a high-swing GBCA, and find the optimum band- 
width for the boosting or feedback amplifier. This bandwidth elimi- 
nates the slow-settling component in the step response, prevents the 
gain-boosting loop from being unstable, and results in the shortest 
settling time. Finally, we present a very high-speed and high-swing 
amplifier suitable for low-voltage applications. 
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I. Introduction 

HIGH-performance, low -power amplifiers capable of oper- 
ating at a supply voltage of 1.8 V or below are required 
as building blocks of portable mixed-signal systems. For fast 
and complete charge transfer, amplifiers in switched-capacitor 
circuits have to meet two requirements: high-speed and high 
open-loop gain. However, optimizing amplifiers for speed and 
gain leads to contradictory demands. The high-gain requirement 
leads to multistage designs with long-channel devices biased at 
low current levels, whereas the high-speed requirement calls for 
a single-stage design with short-channel devices biased at high 
current levels [1]. 

The gain-boosting technique [Fig. 1(a)] has enabled circuit 
designers to exploit the advantages of single-stage amplifiers 
with adequate gain. In the technique, the output resistance and 
the gain of the main cascode amplifier are increased consider- 
ably using a feedback amplifier (FA). However, if the FA is not 
properly designed, a low-frequency pole-zero frequency dou- 
blet will appear in the transfer function [1] of the main ampli- 
fier. Even though the doublet does not noticeably affect the fre- 
quency response, it introduces a very slow-settling component 
in the settling time, which is undesirable for many applications 
[2]. 

The proper design procedure of a gain-boosted cascode 
amplifier (GBCA) has been a subject of study during the past 
years [3]-[6]. All the methods and design rules presented thus 
far [3]-[6] are suitable for the basic GBCA structure shown in 
Fig. 1(b). However, the circuit has a limited output swing with 
low supply voltages. The limited output swing problem can be 
explained as follows: from Fig. 1(b), the lower end of the output 
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Fig. 1 . (a) Enhancing the gain of a cascode stage using gain-boosting method, 
(b) Basic structure of a GBCA. 
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Fig. 2. High-swing GBCA using (a) a folded cascode FA, and (b) a voltage 
level shifter for FA. 



swing is limited by Vqs3 + ^Z)S2,sat> which is about 0.8 V 
in a typical 0.18-/mi CMOS process. Since, the gain-boosting 
method, for being effective, should also be used for the current 
source Ib , a similar limitation exists for the upper end of the 
output swing. As a result, the allowable voltage swing at the 
output node is about 0.2 V with a 1.8-V supply voltage, which 
is not satisfactory for most applications. Hence, the previously 
proposed optimization methods are not useful in low-voltage 
circuit design. 

There are some other GBCA structures for which the FA 
does not restrict the output swing (Fig. 2) [7], [8]. However, 
these structures suffer from a stability problem. In the struc- 
tures shown in Fig. 2, the FA has at least two poles in its transfer 
function, i.e., a low-frequency pole at node Fl and a high-fre- 
quency pole at node F2. Since, in addition to those poles, there 
is another pole at node X, there are at least three poles in the 
gain-boosting loop consisted of the FA and transistor Mi- This 
situation makes the gain-boosting loop very susceptible to in- 
stability. 

In this paper, we study the optimum design criteria for 
designing a high-swing GBCA suitable for low-voltage appli- 
cations. To the best of our knowledge, this is the first paper 
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Fig. 3. Small-signal model of GBCA. 

dealing with the optimum design of these amplifiers. The or- 
ganization of this paper is as follows: in Section II, we study the 
small signal behavior of a GBCA and develop a good generic 
model for a GBCA. Using this model, the criteria for designing 
a speed-optimized GBCA are presented in Section III. Sec- 
tion IV presents a very high-speed and low-voltage folded 
cascode amplifier, and Section V concludes. 

II. Modeling 

In order to find the proper design procedure for the GBCA, we 
need to find the closed-loop transfer function of the GBCA and 
analyze the GBCA time response based on the derived transfer 
function. 

FA structures used in high-swing GBCAs are either a two- 
pole amplifier or can be modeled as a two-pole amplifier. Hence, 
we have modeled the FA with a two-pole transfer function given 
by 

a(S) 



(*+*)( 



s 

Pf-2 



0+^H 1 



s 

P F2 



(1) 

where Ppi denotes the dominant pole of the FA, and Pp 2 rep- 
resents the other nondominant poles and zeros of the FA. clq 
denotes the open-loop gain of the FA and cu u is the gain-band- 
width product (GBW) of the FA equal to aoPp\. 

Fig. 3 shows the small-signal equivalent circuit of a GBCA. 
From the figure, the open-loop transfer function of the GBCA, 
A(S), can be calculated as (2), shown at the bottom of the 
page. In (2), g m , g ds and <7 m [, denote transconductance, output 
conductance, and body-effect transconductance of a MOS tran- 
sistor, respectively. Cx and Cp are total capacitance seen at 
node X and output, respectively, and Gp is the output conduc- 
tance of the current source Ig. 

The feedforward paths created by Cgdi and Cqd2 produce 
a RHP zero at Z\ = 3Vni/CGDi and a LHP zero at Z 2 = 
<?m2/CGD2> respectively. These zeros, however, are at very high 
frequencies, and their effects can be ignored on the circuit be- 
havior. 

Since a(S) is a two-pole transfer function, the numerator of 
A(S) is of second order, i.e., two zeros, and the denominator of 
A(S) is of forth order, i.e., four poles in the open-loop transfer 



function of the GBCA. Although the calculated A(S) is accu- 
rate, it is complicated and must be simplified for ease of use in 
finding the optimum criteria. 

In order to obtain some feeling for the small-signal behavior 
of this amplifier, the frequency location of the poles and zeros 
was numerically evaluated by considering a representative ex- 
ample. This was achieved by implementing and solving A(S) 
with MATLAB [9], from which we were able to analyze the in- 
fluence of the FA on the movement of the poles and zeros of the 
GBCA. 

This numerical approach of the open-loop transfer function 
behavior is illustrated in Fig. 4(a) where the GBW of the FA, u> u , 
is the parameter under consideration. Fig. 4(a) clearly shows that 
at a low oj u , there are two individual poles and two pole-zero 
frequency doublets in the open-loop transfer function. Obvi- 
ously, the output pole of the GBCA, Pi, is at much lower fre- 
quencies rather than other poles and zeros, and its frequency can 
be approximated by P 1 = .feifl^/aoflVniflV^Cx- 

A low-frequency pole-zero doublet pair, Z\ and P 2 , is lo- 
cated in the vicinity of the uj u , a high-frequency pole-zero dou- 
blet pair, Z2 and P4, in the vicinity of the Pp 2 , and a high-fre- 
quency individual pole P 3 in the vicinity of the second pole of 
the main cascode amplifier. Indeed, P3, when cu u is very low, is 
equal to the second dominant pole of the main cascode amplifier 
located at node X, which we refer to as Px- Px is given by 

3Vn2+flVn62 3Vn2(l + T]) 



Px 



C x 
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(3) 



where 7? = g m b2Jg m 2- 

Increasing lj u repels the low-frequency doublet to higher fre- 
quencies until it merges with P3 to form a complex-conjugate 
pole pair. However, before merging with P3 the pole and zero 
of the low-frequency doublet exactly cancel each other at two 
points, PI and P2. We have demonstrated that the value of uj u 
at these two points can be calculated by 

gds2(l + v) 9dsi{l + v) ,,, 

wm = -p, , wd2 = -p, • (4) 

Between PI and D2, the doublet is very compressed and the 
frequency of its zero Z\ is greater than the frequency of its pole, 

P 2 . 

Fig. 4(a) also shows that over-increasing lj u pushes the com- 
plex-conjugate pole pair, created by merging the low-frequency 
doublet with P 3 , to the RHP of the complex plane, which makes 
the amplifier unstable. 

Although the low-frequency doublet has no significant effect 
on the frequency behavior of the GBCA, it introduces a slow- 
settling component in the transient response[2]. The slow com- 
ponent has different effects on settling time to different error 
bands. As a typical example, Fig. 4(b) shows the settling time 
to 0.1% and 0.01% error bands in the step response of the same 
amplifier discussed in Fig. 4(a) connected in a unity-gain feed- 
back configuration. The figure is created by MATLAB simula- 
tion of the system with open-loop transfer function described by 
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(C X C L S 2 + g m2 {G L + C L S)a(S) + ((g dsl + g ds2 + g mh2 + g m 2)C L S + (g ds2 + G L )C X S) 
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Fig. 4. (a) Pole-zero frequency locations, and (b) 0.1% and 0.01% settling 
times of an instance GBCA versus the GBW of the FA, n>„; for a = 200, 
C L = 10 pF, C x = 3 pF, g ml = 4 mS, g m2 = 2.4 mS, g mb2 = 0.4 mS, 
g dsl =100 /iS, g ds2 = 50 /iS, G L = 0.01 /iS, and P F2 = 3G rad/s. 



(2). From the figure we can see that the doublet has no consid- 
erable effect on the settling time to 0.1% error band, but it de- 
grades the settling to 0.01% error band drastically. This means 
that slow-settling component falls inside the 0.1% error band. 

By increasing w u , at points Dl and D2 where pole and zero 
of the doublet cancel each other out, two minimum points occur 
in the 0.01% settling time. Although the doublet is very com- 
pressed between those points, it still considerably slows down 
the 0.01% settling time. 

It may seem that the value of lo u at points Dl and D2, i.e., 
u>£,i and u)£>2, are suitable bandwidths for the FA, however, at 
these points, the settling time is very sensitive to the value of uj u . 
Besides, regarding (4), the value of uj u at these points is pro- 
portional to the output conductance g,j s of transistors M\ and 
Mi. Since the output conductance g^ a of a transistor in a typ- 
ical MOS technology is subject to large changes due to process 
variations, a designer is not able to exactly adjust w u equal to 
w m and oj D2 . 

Concluding, if we design a wide-band FA, the complex-con- 
jugate pole pair is pushed to the RHP of complex plane and the 
gain-boosting loop will be unstable, on the other hand, if we 
design a low-bandwidth FA, the pole-zero doublet drastically 
slows down the precise settling times. In addition, designing 
a FA with a bandwidth equal to lod\ or wjyi does not solve 
the problem because of the dependency of utjji or wd2 on the 
output conductance of transistors Mi and Mi. Thus, choosing 
a suitable bandwidth for the FA is very critical, especially in 
high-swing GBCA structures. 

Fig. 4(b) clearly shows that by further increasing uj u , another 
minimum point, _D3, occurs in the 0.01% settling which, al- 
though is not as low as the other two minimum points, Dl and 



D2, it is less sensitive to the value of lj u . Hence, this point is 
the optimum value for u> u . As we can see by comparing Fig. 4(a) 
and (b), at point D3, the doublet is replaced by a complex-con- 
jugate pole pair. 

Equation (2) is too complicated to be used for deriving the 
optimum criteria. As a result, we have to simplify it to be able 
to find the optimum bandwidth for the FA. Based on the obser- 
vation we made in the previous section, we do following sim- 
plifications in A(S) and mention their impacts. 

1) The FA transfer function at high frequencies can be well 
approximated by [8] 



<x(S) 



LU„ 



S^ + T^k 



(5) 



2) 



This simplification results in erroneous calculation of the 
dominant pole frequency of A(S), P\, which is of no 
important consequence to the settling time. 
The transconductance of a MOS transistor is much 
greater than its output conductance; i.e., g m 3> g^. 
This simplification results in erroneous calculation of 
wpi and wd2> but this is also not important, because we 
already know that the optimum value of u> u is not around 
ojdi or lj D2 . 
Considering the above simplifications in A(S) and assuming 
a unity-gain feedback, the closed-loop transfer function of the 
GBCA, H(S), is given by (6) shown at the bottom of the next 
page. By dividing the numerator and the denominator of (6) by 
C X C L and recalling that P x = (g m 2 + g m b2)/C x , and gain- 
bandwidth product of the whole amplifier is given by GBW = 
fl'mi/Ci [8], (6) can be rearranged to (7), shown at the bottom 
of the next page. If we normalize (7) to the GBW of the GBCA 
by the help of following equations: 



. a+v) 

GBW 
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F2 



Xn- 



GBW 



F2n- 



GBW 



GBW 



(8) 



we are able to convert (7) to a generic model for the GBCA 
given by (9) shown on the next page. Obtaining (9) enables us 
to easily consider the impacts of the important FA parameters, 
such as Pp2 and u> u , on the settling time of the GBCA 

III. Optimum Design Criteria 

In order to find the optimum design criteria for a GBCA, we 
should find an analytical expression for the settling time from 
the normalized closed-loop transfer function, H n (S). However, 
because of the potentially transcendental nature of the derived 
closed-loop transfer function, an analytical expression for the 
settling time can not be generally derived. The desperate ap- 
proach is to sweep all parameters existing in H n (S) and find 
the best relation between those parameters giving the shortest 
settling time. 

Three parameters, uj un , Px n and Pf2u, are in H n (S). We 
have swept them in the practical range of interest and found the 
resulting settling time using MATLAB simulations. The Px n 
is the ratio of Px, i.e., the second pole of the main cascode 
amplifier, over GBW, i.e., the gain-bandwidth product of the 
main amplifier. As a result, it can represent the phase margin 
(PM) of the main cascode amplifier. It can also represent the 
PM of the GBCA, because the FA has no significant effect on 
the frequency response [3]-[6]. The interest range for Pxn is 
from 1.5 to 3.6, which corresponds to a PM of 60° to 75°. 
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In addition, since the constraints for the design of the FA are 
fewer than those for the main cascode amplifier, P F2 can be 
potentially at very high frequencies compared to GB W and Px ', 
as a result, we have swept Pfiu from 2 to 10. 

By comparing the curves shown in Fig. 5, we can make the 
following observations: 1) pushing Px and Pp2 to higher fre- 
quencies reduces the minimum achievable settling time. Having 
Pxn < 2.6 and Pp2n < 5 is not suitable for designing ampli- 
fiers with short 0.01% settling times. Besides, at a low Pxm 
the sensitivity of settling time to u) un , at optimum u) un , is rela- 
tively high. 2) If P Xn > 2.6 and P F2n > 5, then the FA band- 
width, u> u , should be almost the same as that of the main cas- 
code amplifier (sometimes a greater lo u can have a better result, 
but increasing uj u is power consuming; hence, we suffice to set 
w u = GBW). 3) If P Xn > 2.6 and P F2n > 5, then the normal- 
ized settling time at optimum lo u is almost between 10 and 12. 
Since all frequencies are normalized to GBW, settling times are 
normalized to GBW -1 . 



As a result of above observations, a simple design procedure 
for designing a GBCA with a short 0.01% small-signal settling 
time equal to Tg seconds can be explained as follows: 1) design 
a simple cascode amplifier with a GBW [Hz] of 10/(27rTs) and 
a phase margin higher than 70°; 2) find the total capacitance 
seen at the gate of M 2 , i.e., Cqd2 + CgS2- This is the load 
capacitance of the FA; 3) design a FA with a bandwidth equal to 
(or even slightly greater than) the bandwidth of the main cascode 
amplifier (the exact optimum bandwidth for the FA should be 
determined from Fig. 5 based on the exact values of Pxn an d 
Pf2u)\ 3) push the second pole of the FA to a frequency greater 
than 5 times the GBW. That is, the PM of the FA should be 
more than 80°. 

It should be noted that, in general, the load capacitance of the 
FA i.e., Cgd2 + Cgs2 is much smaller than that of the main 
cascode amplifier, and since the bandwidths of the FA and the 
main cascode amplifier should be almost equal, in an optimum 
designed GBCA, the FA amplifier consumes much less power 



H(S) 
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Fig. 6. Whole schematic of the designed high-speed low-voltage folded 
cascode amplifier (bias circuit is not depicted). 



TABLE I 

Summary of Designed Amplifier Specifications With C l 
1.8 V Power Supply Voltage 



1 pF AND 



Gain 


80dB 


GBW 


660 MHz 


Phase Margin 


73° 


0.1% Settling Time 


1.7 nS 


0.01% Settling Time 


2.2 nS 


Slew Rate 


800 V/(iS 


Output Swing 


1.2 V 


Power dissipation 


3.8 mW 



compared with that of the main cascode amplifier. Due to the 
same reason, the g m of the FA is much less than that of the main 
cascode amplifier; hence, the input transistor of the FA is very 
smaller than M\ and it does not add a noticeable capacitance ton 
node X. Thus, after adding FA to the main cascode amplifier, the 
Px does not move noticeably. 



IV. Simulation Results 

Based on the described design procedure, we have designed a 
very high-speed and low-power folded cascode operational am- 
plifier in a 0. 1 8-//m CMOS process. The whole schematic of the 
amplifier is shown in Fig. 6. The amplifiers X and Y are feed- 
back amplifiers which also have folded cascode structure. The 
actual length and width of the transistors are selected by the de- 
sign procedure stated in [10]. Table I summarizes the specifica- 
tions of the amplifier. 

We have shown the step response of the amplifier in Fig. 7 
when the FA bandwidth lo u is optimum, is low, and is high. 
Fig. 7(a) clearly shows that when the bandwidths of the FAs 
are more than the optimum values, there is some ringing in 
the step response. If the bandwidths were increases more, the 
gain-boosting loops would be unstable. Fig. 7(b) shows the step 
response on an expanded scale to allow observation of 0.01% 
settling time. The figure obviously shows that when the band- 
widths of FAs are low the slew-settling component introduced 
in step response causes a very slow settling to 0.01 % error band. 




High co u 

Optimum co u 
Low co 




( b) Tim e M 

Fig. 7. (a) Step response of the designed amplifier with the optimum co u , a 
low oj u , and a high co u . (b) The step response in an expanded scale. 

V. Conclusion 
In this paper, a novel and generic model for GBCAs was de- 
veloped. A method for optimum design of these amplifiers for 
high-speed applications was presented. This method is suitable 
for all types of gain-boosted cascade amplifiers, especially in 
low-voltage and low-power applications. Finally, a very high- 
speed and low-voltage amplifier was presented. 
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